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ABSTRACT 

We-  consider  examples  (1-D  seismic,  large  flexible  structures,  bioturbation, 
nonlinear  population  dispersal)  in  which  a  variational  setting  can  provide  a 
convenient  framework  for  convergence  and  stability  arguments  in  parameter 
estimation  problems. 
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I.  Introduction 


In  this  note  we  consider  one  aspect  (arguments  for  convergence  and 
stability  via  a  variational  approach)  of  least-squares  formulations  of  parameter 
estimation  problems  for  partial  differential  equations.  Conceptually,  one  has  a 
dynamical  model  with  "states"  u  =  u(t,x),  0  i  t  <  T,  x  «  O,  and  parameters 
q  =  q(t,x)  in  some  admissible  set  Q.  Given  observations  or  data,  z  e  Z, 

A 

of  some  type  (e.g.,  z  =  (u^)  as  observations  f  { u(tj,x j)}),  one  wishes  to 

determine  parameters  q  that  give  a  best  fit  of  the  model  to  the  data.  That 
is,  one  has  the  constrained  optimization  problem:  From  an  admissible 
parameter  set  Q,  choose  a  parameter  q  so  that  the  corresponding  solution  of 
the  dynamical  model  gives  the  best  fit  to  data  using  a  least-squares  fit 
criterion. 

Abstractly,  we  have  a  state  space  H  in  which  we  solve  a  dynamical 
system  (5)  for  parameter  dependent  solutions  u  =  u(q)  with  the  parameters 
chosen  from  some  infinite  dimensional  set  Q.  If  C:  H  -  Z  is  a  mapping 
from  the  state  space  to  the  observation  space  Z,  the  problem  is  one  of 
minimizing 

2 

(11)  J(q,z)  =  |Cu(q)  -  z  [ 

over  q  e  Q,  where  I  ■  I  ^  is  an  appropriately  chosen  norm  in  Z. 

The  fact  that  many  problems  of  interest  arc  infinite  dimensional  in  both 
state  spaces  H  and  parameter  sets  Q  leads  to  a  rich  class  of  mathematical 
questions  including  wcll-poscdncss,  stability,  and  computational  approaches.  For 


example,  consider  the  possibility  of  approximating  the  state  space  H  by  a 
sequence  HN  of  finite  dimensional  spaces  and  approximating  the  parameter 
set  Q  by  a  sequence  of  finite  dimensional  sets  so  as  to  obtain 

approximating  problems:  Minimize 

(1-2)  JN(q,z)  =  |CuN(q)  -  z|2 

Z 

over  q  e  QM,  where  uN  is  an  approximate  solution  to  (S)  lying  in  HN.  An 
important  question  concerns  the  ways  in  which  approximating  Q  and 

Hn  approximating  H  might  guarantee  convergence  of  solutions  qN,M  of 

the  problems  of  minimizing  JN  over  QM  to  a  solution  q  of  the  problem  of 
minimizing  J  over  Q.  A  number  of  results  [8],  [27]  in  this  area  are  available 
and  we  just  sketch  one  set  of  arguments  here  (for  examples  and  more  details, 
sec  [8]) 

Suppose  that  the  sets  Q  and  QM  lie  in  some  metric  space  Q  and  that, 
in  fact,  there  is  a  mapping  Q  -»  QM  so  that  <$*  =  iM(Q).  Further, 

assume  that  the  following  hypotheses  are  satisfied  by  and  HN: 

(i)  For  any  qk  -  q  in  Q  we  have  CuN(qk)  -*  Cu(q)  in  Z  as  N,k  -»  <*>; 

(ii)  For  each  N,  the  mapping  q  -»  JN(q,z)  is  continuous  in  the  Q 

topology; 

(iii)  The  sets  Q  and  QM,  for  each  M,  are  compact  in  the  Q  topology; 

( i v )  For  each  q  €  Q,  iM(q)  -•  q  in  Q  with  the  convergence  uniform  in 
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Undcr  these  assumptions,  let  qN,M  be  solutions  for  the  problems  for  (1.2)  and 
let  qN,M  t  Q  be  such  that  iM(qN,M)  =  qN,M.  From  the  compactness  of  Q. 
we  may  select  subsequences,  again  denoted  by  (qN,M)  and  {qN,M}  so  that  qN,M 
-*  q  €  Q  and  qN,M  -*  q  (the  latter  follows  from  (iv)).  The  optimality  of 
qN,M  guarantees  that  for  every  q  (  Q 

(1.3)  JN(qN'M,z)  <  JN(iM(q),z). 

Using  (i)  and  (iv)  and  taking  the  limit  as  N,M  -*  ®  in  this  inequality  yields 
J(q,z)  <  J(q,z)  for  every  q  t  Q,  or  that  q  is  a  solution  of  the  problem  for 
(1.1).  (Under  uniqueness  assumptions  on  the  problems,  one  can  actually 
guarantee  convergence  of  the  entire  sequence  in  place  of  subsequentia! 

convergence  to  solutions.) 

Wc  note  that  the  essential  aspects  in  the  arguments  sketched  here  involve 
compactness  assumptions  on  the  sets  QM  and  Q.  Such  compactness  ideas  play 
a  fundamental  role  in  other  theoretical  and  computational  aspects  of  these 
problems.  For  example,  one  can  formulate  distinct  concepts  of  problem 
stability  and  method  stability  involving  some  type  of  continuous  dependence  of 
solutions  on  the  observations  z  in  Z,  and  use  hypotheses  similar  to  fi)  -  (iv), 
with  compactness  again  playing  a  critical  role,  to  guarantee  stability.  Wc 
illustrate  with  a  simple  form  of  method  stability  (other  stronger  forms  arc  also 
amenable  to  this  approach). 

Wc  might  say  that  an  approximation  method,  such  as  that  formulated 
above  involving  QM,  HN  and  (1.2),  is  stable  if 
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dist  (qN’M(zk),q(z°))  -•  0 

as  N,M,k  -»  0  for  any  z*5  -»  z°  in  Z,  where  q(z)  denotes  the  set  of  all 

solutions  of  the  problem  for  (1.1)  and  qN,M(z)  denotes  the  set  of  all  solutions 
of  the  problem  for  (1.2).  Here  "dist"  represents  the  usual  distance  set  function. 
Under  hypotheses  (i)  -  (iv)  one  can  use  arguments  very  similar  to  those 
sketched  above  to  establish  that  one  has  this  method  stability.  If  the  sets 
Qm  arc  not  defined  through  a  mapping  iM  as  supposed  above,  one  can  still 
obtain  this  method  stability  if  one  replaces  (iv)  by  the  assumptions: 

(v)  If  (q^  is  any  sequence  with  qM  e  QM,  then  there  exists  q*  in  Q 
and  subsequence  (q^  with  q^  -»  q*  in  the  Q  topology; 

( v i )  For  any  q  c  Q,  there  exists  a  sequence  (qM)  with  q*^  e  QM 

such  that  q14-*  q  in  Q  . 

Similar  ideas  may  be  employed  to  discuss  the  question  of  problem 
stability  for  the  problem  of  minimizing  (1.1)  over  Q  -  i.c.  the  orginal 
problem  and  again  compactness  of  the  admissible  parameter  set  plays  a  critical 
role.  For  discussions  of  other  questions  related  to  problem  stability,  see  [19], 
[21]  -  and  specifically  Remark  5.1  of  [21], 

Compactness  of  parameter  sets  also  appears  to  play  an  important  role  in 
computational  considerations.  For  example,  in  certain  problems  the  formulation 
outlined  above  (involving  =  IM(Q))  results  in  a  computational  framework 

wherein  the  and  Q  all  lie  in  some  uniform  set  possessing  compactness 

properties.  The  compactness  criteria  can  then  be  reduced  to  uniform 
constraints  on  the  derivatives  of  the  admissible  parameter  functions.  Wc  have 
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numcrical  examples  which  show  that  imposition  of  these  constraints  is 
necessary  (and  sufficient)  for  convergence  of  the  resulting  algorithms.  (This 
offers  a  possible  explanation  for  some  of  the  numerical  failures  of  such 
methods  reported  in  the  engineering  literature  -e.g.  see  [37].). 

Thus  we  have  that  compactness  of  admissible  parameter  sets  play  a 
fundamental  role  in  a  number  of  aspects  -  both  theoretical  and  computational- 
in  parameter  estimation  problems.  This  compactness  may  be  assumed  (and 
imposed)  explicitly  as  we  have  outlined  here,  or  it  may  be  included  implicitly 
in  the  problem  formulation  through  Tychonov  regularization  as  recently 
discussed  by  Kravaris  and  Seinfeld  [25].  In  the  regularization  approach  one 

restricts  consideration  to  a  subset  Qj  of  parameters  which  has  compact 

imbedding  in  Q,  modifies  the  least-squares  criterion  to  include  a  term  which 
insures  that  minimizing  sequences  will  be  Q,  bounded  and  hence  compact  in 
the  original  parameter  set  Q. 

Having  made  a  case  for  the  role  that  compactness  of  admissible 

parameter  sets  might  play  in  parameter  estimation  problems,  we  turn  finally  to 
the  (not  unrelated)  focus  of  this  note.  In  particular,  we  wish  to  discuss  some 
problems  in  which  a  variational  formulation  (as  opposed  to  the  semigroup 

approximation  framework  we  have  used  in  many  of  our  previous  discussions  of 
these  problems  -  see  [3,4,5,7,12,13,17])  permits  relaxation  of  the  compactness 
criteria  needed  in  convergence,  stability  and/or  computational  analyses.  We 
present  several  problems  for  which  the  variational  framework  can  be  used  to 
give  convergence  arguments  in  the  spirit  of  techniques  commonly  used  in  the 
finite-clement  approach  (sec  [22]  and  the  references  therein)  to  initial-boundary 
value  problems  for  partial  differential  equations.  As  we  shall  sec  below,  the 
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"energy  functionals"  in  our  case  are  parameter  dependent  and  the  arguments 
can  become  somewhat  tedious  in  some  instances. 

In  the  next  two  sections  we  discuss  problems  for  which  the  variational 
approach  offers  an  alternative  to  the  semigroup  formulation.  However,  there 
are  some  problems  for  which  the  semigroup  approach  is  not  readily  employed 
but  for  which  a  variational  framework  is  rather  natural.  We  present  two 
su^h  examples  in  Sections  4  and  5. 

To  facilitate  our  discussions,  in  some  cases  we  restrict  our  remarks  to 
problems  in  which  we  minimize  J  and  JN  of  (1.1)  and  (1.2)  over  a  fixed  set 
Q.  relegating  the  role  that  approximating  sets  play  to  comments  and 

referring  the  reader  to  [8]  for  an  explanation  of  how  one  readily  extends  the 
ideas  to  problems  of  minimizing  JN  over  QM  where  QM  approximates  Q. 


II.  A  "1-D  Seismic"  Inverse  Problem 


We  consider  the  system 


:2.i) 


d2u  d  du 

q,(x)  — 2  =  —  (q,(x)— ) 

1  at  ax  2  ax 


t  >  o,  x  e  n  =  (0,1) 


[2.2) 


- (t,0)  +  q  u(t,0)  =  f(t) 

ax  3 


3u  du 

[2.3)  - (t,l)  +  q 4  - (t,l)  =  0 

at  4  ax 

(2.4)  u(0,  - )  =  u0  ,  ut(0,  • )  =  vQ 

and  the  associated  inverse  problem  of  estimating  qp  q2,  q3.  q4,  f,  given  a 

set  of  observations  {yv}  for  (u(t.x.)}. 

Such  problems  arc  motivated  by  certain  versions  of  the  so-called  "I-D 
Seismic  Inversion  Problem"  (see,  c.g.  [2],  [18]).  Roughly  speaking,  one  has  an 
clastic  medium  (c.g.,  the  earth)  with  density  qt  and  clastic  modulus  q  A 
perturbation  of  the  system  (explosions,  or  vibrating  loads  from  specially 
designed  trucks)  near  the  surface  (x=0)  produces  a  source  f  for  particle 
disturbances  u  that  travel  as  clastic  waves,  being  partially  reflected  due  to 


the  inhomogeneous  nature  of  the  medium.  An  important  but  difficult  problem 
involves  using  the  observed  disturbances  at  the  surface  or  at  points  along  a 
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"bore  hole"  to  determine  properties  (represented  by  parameters  in  the  system)  of 
the  medium.  In  the  highly  idealized  1-D  "surface  seismic"  problem,  one 
assumes  that  data  are  collected  at  the  same  point  (x=0)  where  the  original 
disturbance  or  "source"  is  located.  In  addition  to  this  hypothesis,  other 
unrealistic  special  assumptions  are  made  about  the  nature  of  the  traveling  and 
reflected  waves.  Although  the  standard  1-D  formulations  are  far  from  reality, 
exploration  seismologists  have  developed  techniques  for  processing  actual  field 
data  (performing  a  series  of  experiments  and  "stacking"  the  data)  so  that  the 
1-D  problems  are  generally  accepted  as  useful  and  worthy  subjects  of 
investigation.  Consequently,  numerous  papers  (for  some  interesting  references, 
sec  the  bibliographies  of  [2],  [18])  on  the  1-D  problems  can  be  found  in  the 
research  literature. 

In  many  formulations  of  the  seismic  inverse  problem,  the  medium  is 
assumed  to  be  the  half-line  x  >  0  (with  x  =  0  the  surface)  while  in  others 
(especially  some  of  those  dealing  with  computational  schemes)  one  finds  the 
assumption  of  an  artificial  finite  boundary  (say  at  x  =  1)  at  which  no 
downgoing  waves  are  reflected  (  an  "absorbing"  boundary).  For  the  1-D 
formulation  this  condition  is  embodied  in  a  simple  boundary  condition  of  the 
form  (2.3);  here  q4  -  ✓qj.fi )  /q1(i )  and  one  can  view  this  boundary  condition 
as  resulting  from  factoring  the  wave  equation  (2.1)  at  x  =  1  and  imposing 
the  condition  of  "no  upgoing  waves"  at  x  =  1. 

Equation  (2.1)  is  a  1-D  version  of  the  equations  for  an  isotropic  clastic 
medium  while  (2.2)  represents  an  elastic  boundary  condition  at  the  surface  x 
=  0  (q3  represents  an  elastic  modulus  for  the  restoring  force  produced  by  the 

medium). 
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In  the  usual  seismic  experiment,  the  medium  is  assumed  initially  at  rest 
so  that  Uq  =  v0  =  0.  While  our  analysis  below  can,  with  some  tedium,  be 
extended  to  treat  the  case  of  parameter  dependent  initial  conditions,  we  shall 
assume  that  u0,  vQ  are  given  Known  functions. 

For  our  discussions  here,  we  shall  also  assume  that  the  source  term  f  is 
to  be  estimated  in  some  function  space  although  frequently  this  term  can  be 
parameterized  in  terms  of  a  Euclidean  set  of  parameters,  thus  simplifying 
somewhat  the  analysis.  We  shall  also  assume  that  qt(x)  =  1  to  facilitate 
our  arguments  (otherwise  the  analysis  is  somewhat  more  tedious  and  involves 
the  use  of  parameter  dependent  inner  products). 

We  reformulate  the  system  (2.1)-(2.4)  in  variational  or  weak  form,  seeking 
a  solution  t  -*  u(t)  on  0  <  t  <  T  ,  with  u(t)  e  H^O)  ,  satisfying 

(2.5)  <utt,  4»>  +  <q2Du,D4,>  +  ut(t,l)qXl)  -  q2(0)  [q3u(t,0)  -  f(t)]^0)  =  0 

^4 

for  all  €  H1(fl),  along  with  initial  conditions 


(2.6) 


u(0)  =  u0  ,  ut(0)  =  vQ  . 


Here  and  throughout,  unless  otherwise  noted,  <  ,  >  denotes  the  usual  inner 
product  in  H°  =  L2  and  D  =  The  parameters  q  =  (q2,  q3,  q4,  f)  arc 

assumed  to  be  in  some  subset  of  C(n)  x  R1  x  R1  x  C(0,T)  ,  although  as  wc 
shall  point  out  later,  these  smoothness  requirements  can  be  relaxed. 


The  system  (2.5)  -  (2.6)  can,  for  the  purposes  of  analysis,  be  abstractly 
formulated  using  the  state  space  H  =  R2  x  H°(0)  for  states  u(t)  =  (u(t,0). 


u(t,l),  u(t,-)).  To  be  precise,  define 


V  =  ^  „,*,+)  £  H  :  n  =  +(0),  *  =  »tfl),  *  «  H1^)} 


and,  for  v  =  (n,4,v)  in  H  ,  the  operators  M0:  H  -*  H,  N0:  H  -•  H  by 


M0C  =  (0,0,v)  No^  =  <°>  ^  4,0)- 


We  also  define  the  functionals  a:  V  x  V  -»  R1,  b(t):  V  -»  R1  by 


a(v,  *)  =  <q2Dv,Dv|*>  -  q2(0)q3v(0)+(0) 
b(t)4»  =  q2(0)f(t)+(0)  . 

Then  we  can  rewrite  (2.5)  -  (2.6)  as 

(2.7)  <M0utt,  +  <NQut,v}<>|^  +  a(u(t),^)  +  b(t)4»  =  0 

where  u(t),  |  (  V,  We  note  that  in  this  case  the  operators  N0.  b(t) 
the  functional  a  each  depend  on  unknown  parameters. 


and 
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Standard  arguments  [28,  p.273]  can  be  used  to  guarantee  existence  of 
solutions  u  to  (2.7),  (2.6)  satisfying  u  £  C(0, T;V),  ut  e  C(0,T;H),  uu  e 
H°(0,T;V')  with  (2.7)  being  satisfied  in  a  weak  or  distributional  sense. 

Furthermore,  one  can  rewrite  (2.7)  as  a  first  order  system  and  use  semigroup 
techniques  to  argue  that,  under  additional  smoothness  assumptions  on  the 
parameters  and  initial  data,  one  obtains  strong  solutions  that  enjoy  additional 
smoothness  properties.  We  shall  return  to  these  considerations  below. 

Turning  to  approximation  and  convergence  arguments,  we  shall  work  with 
our  system  in  the  form  (2.5),  (2.6)  although  we  could  equivalently  use  (2.7)  in 
our  considerations.  We  consider  Galerkin  approximations  on  finite 
dimensional  subspaccs  H‘N  <=  H^ft),  N  =  1,2,...,  and  make  the  standing 

assumption  on  the  orthogonal  projections  PN:  H°(n)  -»  HN. 

Assumption  A:  If  <t>  e  H°(0),  then  PN«t»  -»  <t>  in  H°.  For  each  <t>  £  H*(n)  , 

we  have  -•  <t>  in  H1. 

If  the  observations  for  (2.1)  -  (2.4)  for  use  in  the  least-squares  functional 
are  given  in  pointwise  form  u(ti,Xj)  for  the  displacement  or  in  an  H°  sense 
ux(t.,.)  for  the  strain,  it  suffices  for  the  convergence  and  stability  arguments 
to  argue  that  uN(t;qN)  -»  u(t;q)  in  H!(fl)  whenever  qN  ->  q  in  an 

appropriate  sense,  where  uN  is  the  Galerkin  approximation  to  the  original 

system  (2.5),  (2.6). 

For  parameters  qN  =  (q£  qg,  q*,  fN)  in  an  admissible  parameter  set 
Q  ,  the  approximating  systems  arc  given  by:  Find  uN(t)  e  Hn  satisfying  for 
all  vj;  €  Hn 
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(2.8)  <u"  *>  + 


<q? 


Dun,D^>  + 


q?0) 


u"(t,l)*(l)  -  qN(0)[qNuN(,i0)  -  fN(t)]^0)=0, 


(2.9) 


uN(0)  =  PNu0 
u**(0)  =  PNv0  . 


Regarding  the  admissible  parameter  set  Q  we  make  the  standing 

assumptions: 

Assumption  B:  The  set  Q  is  compact  in  the  Q  =  C(n)  x  R1  x  R1  x 
H1(0,T)  topology  and  is  contained  in  the  set 

{(q2,  q3,  q4,  f)  €  Q  I  q2(x)  »  v  >  0,  q3  <  -  n  <  0,  0  <  0  <  q4  $  a] 

for  some  fixed  positive  constants  a,0,v,n. 

Suppose  then  that  qN  -•  q  in  Q  ,  where  (qN)  is  an v  convergent 
sequence  in  Q,  and  let  uN(qN),  u(q)  denote  the  corresponding  solutions  to 
(2.8),  (2.5)  respectively.  Under  Assumption  A,  we  sec  from  the  inequality 

|uN(qN)  -  u(q)|4  «  |uN(qN)  -  PNu(q)  [  +  |pN  u(q)  -  u(q)  |t 


< 

1 


i 
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that  it  suffices  to  consider  z^t)  =  uN(t;qN)  -  PNu(t;q)  and  argue  the 
convergence  z^(t)  -»  0  in  H1((l)  for  each  t  in  [0,T]. 

Defining  the  "potential  energy"  functional  4>  :  H1  x  H1  -  R1  by 

(2.10)  <l>(q)(<t>,4>)  =  <q2De>,D4<>  -  Q2(0)q3<t>(0)<^(0) 


and  the  "boundary  damping"  functional  B  :  H1  x  H1  -  R1 


(2.11) 


q2d) 

B(q)(<t>,4<)  h  -  «J>(  1  )4>(  1 )  i 

<14 


we  may  use  (2.5)  (with  q  =  q)  and  (2.8)  in 

<z  ,**<>  =  <(uN  -  u  +  u  -  PNu)tt,*> 

to  obtain 

<z^t,*j/>  +  «<qN)(zN,*)  +  B(qN)(zNt  ,t) 

(2.12)  =  *(q)(u,*)  +  B(q)(ut,v|/)  -  <*qN)(PNu,*)  -  B(qN)(PNur*l 

+  <(I  -  PN)utt,*>  +  [q^O)f(t)  -  q2N(0)fN'(t)]4<0) 


-14- 


for  all  e  Hn.  In  addition  to  this  equation,  2^  satisfies  the  initial 
conditions  (see  (2.6)  and  (2.9)) 


(2.13) 


zN(0)  =  0  ,  z*(0)  =  0. 


Choosing  4;  =  z^1  (which  is  in  HN)  in  (2.12)  and  defining 


(2.14) 


A<tfu)  =  <Kq)(u,zN)  -  <t<qN)(PNu,zN) 


(2.15) 


8?(t)  =  q,  (0)f(t)  -  q?(0)fN(t)  , 


we  obtain  from  (2.12)  the  equation 


|  {1 |*  ♦  «qN)<zN,z")}  +  B(qN)(z^,z[)  ) 


(2.16) 


=  —  {A4<u)}  -  A<Kut)  +  65  (t)  z^(t.O) 


+  B(q)  (ut,z**  )  -  B(qN)(PNut.zNt)  +  <(I-PN)utt.ztN’>. 


If  we  further  define  the  total  energy  functional 
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E(qN)(zN)  =  z*  2  +  <KqN)(zN,zN) 


and  the  auxilary  expressions  (for  notational  convenience) 


(2.17) 


q2d) 


q?0> 

B?(t)  =  -  ut(t,l) - PNut(t,l) 

q4  v,  - 


(2.18) 


(2.19) 


6?(t)  =  q?(0)q?  PNu(t,0)  -  q,(0)qsu(t,0) 


AN(t)  =  q2Du(t)  -  q2  DPiNu(t), 


then  we  can  rewrite  (2.16)  as 


i  d 


7  —  E(qN)(zN(t))  +  B(qN)(zN,zN) 


(2.20) 


d  A<tfu)  -  A4Kut)  +  (5*(t)  zN(t,0)}  -  8^(t)zN(t,0) 


dt 


+  6*(t)zf(t,I)  +  <(I  -  Pw)utt,zf>. 


N\„  ,N. 


We  next  observe  that  from  Assumption  B  we  have 


B(qN)(zN.zf  )  »  |z^  (t,l)  | 2 
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while  the  inequality  be  <  +  4c2  implies,  with  a  proper  choice  of 

constants. 


6?(t)z?(t,l)  S 


2 


Using  these  inequalities  in  (2.20)  we  obtain 

^  {y  E(qN)(zN(t))  -  A*(u)  -  6g(t)zN(t,0)} 

(2.21) 

<  -  A<tfut)  +  ~  1 t)  | 2  -s£(t)zN(t,0)  +  <(I  -  P>tt,z*>. 

Integrating  this  expression  and  using  the  facts  that  A$(u)|t_  0  =  0  ,  zN(0)  =  0  , 
and  E(qN)(0)  =  0  ,  we  thus  find 

(2.22)  E(qN)(zN(t))  <  2a«(u)  +  26*(t)zN(t,0)  +  2PGN(t)dt 

■>0 


where  GN  is  defined  as  the  function  on  the  right  side  of  the  inequality 
(2.21).  Finally,  to  make  use  of  this  bound  on  the  parameter-dependent  energy, 
we  employ  the  following  set  of  inequalities: 


2A<Xu)(t)  «  ^  AN(t)  +^DzN(t)|  (t)  +  —  |zN(t,0) 


28*(t)zN(t,0)  «  4vw|&3(t)|2  +  |  zN(t,0)  |2  , 

-2A4>(u  t)(  0  «  ^  |  A^(  t )  | 2  +  ?|DzNU)|2+  ^7fr|  6^.(4)  | 2  +  i^|zN(^0)|2, 
-28^(OzN(t,0)  <  ^-|63Nt(0|2  +  ^t|zNU,0)|2, 

2<(I  -  P>tt,  z?>  <  |(I  -  P>u|2  +  |ZN|2  , 

E(qN)(zN(t))  >  |zN(t)|2  +  v  |  DzN(t)  | 2  +  vn|zN'(t,0)|2  . 


Then  (2.22)  may  be  replaced  by 


(2.23) 


where 


(2.24) 


and 


(2.25) 


e  (zN(t))  «  rN(t)  + 


E(zNU))dt 

0 


e(zn(0)  =  |z?(0  |2  +  j  |dzn(o|*  +  j-|zN(^.0) |  2 
rN(t)  =  —  I AN(t)  1 2  +  - — 1 8^(t)  I  2  +  - — 1 6g(t)  | 2  +  fT  F(Od( 

v  1  '  vn  vn  Jq 


with 
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(226)  F({)  =  i  k«)|S  ^|s”<t)|!+  4ls"(»r 

+  la  -  p-voi’  +  wi*«r- 

Thus,  from  (2.23)  and  the  Gronwall  inequality,  to  establish  that 
E(zN(t))  -»  0  (and  hence  that  z^t)  -»  0  in  H1(ft)),  it  suffices  to  argue  that 
rN(t)  -»  0  for  each  t  in  (0,T],  In  view  of  the  definitions  (2.15),  (2.17)  - 
(2.19),  this  convergence  is  easily  argued  under  the  Assumptions  A,  B  (where 
qN  -•  q  in  the  Q  topology)  and  that  u(t)  e  H!(n),  ut(t)  e  HJ(n),  uu  e 
H°([0,T]  x  0).  To  complete  the  discussions  of  this  section,  we  shall  comment 
on  a  number  aspects  of  the  above-outlined  results  in  the  form  of  several 
remarks. 

Remark  2.1.  Recalling  Assumption  B,  we  note  that  the  formulation  above 

requires  the  elastic  moduli  q2  lie  in  C(n)  and  that  the  compactness 

properties  of  Q  with  respect  to  this  component  be  in  the  C(0)  sense. 
These  arc  readily  weakened  by  formulating  the  problem  with  the  q2 
component  replaced  by  (q2(0),  q2(l),  q2)  in  R1  x  R1  x  L“(n)  -  see  equation 
(2.5)  -  with  a  corresponding  change  in  the  Q  topology  employed  for  the 
compactness  statement  in  Assumption  B.  This  is  especially  useful  if  one  wishes 
to  consider  discontinuous  elastic  moduli  (an  important  formulation  in 
"multi-layered"  seismic  problems)  and  the  ease  with  which  such  modifications 
are  treated  in  the  variational  framework  above  make  it  ideally  suited  for 
convergence  arguments  when  estimating  discontinuous  parameters. 


Remark  2.2.  In  cases  where  the  initial  functions  Uq,  vq  of  (2.6)  also  depend 
on  unknown  parameters,  i.c.,  Uq  =  uQ(q),  vQ  =  vQ(q) ,  then  the  initial  conditions 
(2.13)  must  be  replaced  by  z^O)  =  PN[u0(qN)  -  u0(q)]  ,  z^(0)  =  PN[v0(qN)  - 
vQ(q)].  The  convergence  arguments  can  be  extended  to  this  case  if  one  makes 
appropriate  smoothness  assumptions  on  Uq,  vq  as  functions  of  the  parameters 

q- 

Remark  2.3.  Assumption  A,  the  fundamental  approximation  hypotheses  on 
Hn  ,  is  readily  shown  to  hold  if  one  chooses  either  the  standard  piecewise 
linear  or  cubic  splines  as  approximation  elements  and  hence  these 
approximation  schemes  arc  included  as  special  cases  in  the  above  treatment. 
The  arguments  and  assumptions  must  be  modified  slightly  if  one  wishes  to 
include  spectral  families  such  as  Legendre  polynomials.  (One  obtains  the 
convergence  P1^  -*  <t>  in  H1  for  <t>  e  H1+c  and  Assumption  A  must  be 
modified  accordingly.  This,  in  turn,  requires  that  u(t),ut(t)  be  in  H1+e  in 
order  to  carry  out  the  convergence  arguments  above.) 

Remark  2.4.  The  presentation  here  assumes  that  one  is  performing  the 
optimization  in  the  least-squares  fit-to-data  over  the  admissible  parameter  set 
Q.  In  general,  this  is  an  infinite  dimensional  function  space  in  the  com¬ 
ponents  q2  and  f.  Thus  one  requires,  as  explained  in  Section  1,  a  second 
approximation  family  QM  and  a  double  limit  procedure.  Recalling  our 
discussions  from  Section  1,  we  note  that  in  the  problems  considered  in  this 
section  it  suffices  to  use  the  set  Q  =  C(fl)  x  R1  x  R1  x  H*(n)  in  the  com¬ 
pactness  and  approximation  statements  involving  QM  and  Q. 
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Remark  2.5.  In  the  presentation  above,  we  found  that  to  guarantee  the  desired 
convergence,  it  suffices  to  have  u(t)  and  ut(t)  in  H‘(n)  and  utt  e  H°([0,T]xn) 
where  u  is  the  solution  of  (2.5),  (2.6)  corresponding  to  any  limit  parameters 
q  in  Q.  Without  further  smoothness  assumptions  on  the  problem  data  one 
cannot  readily  guarantee  this  desired  regularity  for  u.  For  example,  if  q2 
€  L“(fl),  u0  e  H!(n),  vQ  e  H°(0)  and  f  e  H°(0,T)  ,  as  we  have  already  observed 
one  can  use  variational  theory  to  guarantee  existence  of  weak  solutions  with 
utt  e  H°(0,T ;  H'1)-  If,  however,  we  assume  q2  e  H1,  uQ  e  H2,  vQ  e  H1  and  f 
e  C1  ,  then  semigroup  arguments  similar  to  those  given  in  [15]  can  be  made  to 
obtain  strong  solutions  with  u(t)  £  H2,  ut(t)  e  H1,  and  uu  £  H°([0,T]xn).  Thus. 
q2  in  H1  and  sufficient  smoothness  on  u0,  vQ,  f  will  yield  the  desired 
smoothness  for  convergence. 

Remark _ 2ji.  The  problem  considered  in  this  section  was  also  investigated  in 

[13]  using  a  semigroup  (Trottcr-Kato  approximation  theorem)  approach.  Several 
differences  in  the  results  arc  noteworthy.  First,  we  note  that  in  the 
variational  framework  above,  the  approximating  (HN  cz  H^n))  basis  elements 
arc  not  required  to  satisfy  parameter  dependent  boundary  conditions  (contrarv 
to  the  situation  in  [13]).  Furthermore,  the  unknown  source  term  f  in  the 
boundary  condition  (2.2)  can  be  treated  directly  here  (in  [13],  the  treatment 
required  a  transformation  to  a  system  with  homogeneous  boundarv  conditions 
and  nonhomogcncous  equation)  and  this  relaxes  the  smoothness  and  compactness 
assumptions  needed  on  the  f  component  of  Q.  (A  similar  relaxation  can 
be  obtained  in  the  framework  of  [13]  using  a  slightly  different  method  for 
transforming  the  nonhomogeneity  in  the  boundary  condition  to  the  system 
equation.)  In  this  regard  we  also  note  that  the  approach  in  [13]  requires 
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convcrgcnce  -»  q2  in  H*(n)  as  opposed  to  in  C(fl)  (or  R1  x  R1  x  L“(0)  - 
see  Remark  2.1).  Thus  the  related  compactness  assumption  on  the  qn 
component  Q2  of  Q  is  relaxed  from  H1  to  C  compactness.  This  is 

potentially  important  in  several  respects.  The  characterization  of  compactness  in 
C(O)  is  somewhat  more  natural  (e.g.  the  Arzela-Ascoli  lemma)  than  that  in  H1 
(e.g.  see  the  embedding  lemmas  in  [1]).  Furthermore,  in  extending  the  ideas 
here  or  in  [13]  to  treat  estimation  of  discontinuous  coefficients,  compactness  in 
H1  (or  a  piecewise  H1  compactness)  is  more  awkward  and  tedious  to 
formulate  than  a  concept  of  piecewise  C  or  Lm  compactness.  Finally,  we  note 
that  the  mode  of  convergence  required  in  the  compactness  of  Q  also  dictates 
the  type  of  approximating  families  Q^1  one  can  use.  We  recall  that  the 

interpolation  operators  IM  (see  [13]  [32])  for  both  piecewise  linear  and  cubic 
spline  approximations  satisfy  IM(q)  -»  q  uniformly  in  q  e  Q  in  either  C 
or  H1  whenever  Q  c:  H2.  However  there  are  occasion  where  one  might 
desire  to  use  the  weaker  convergence  requirement  (e.g.  when  dealing  with 
discontinuous  coefficients  and  a  piecewise  C  topology),  in  which  the 

variational  formulation  of  the  above  presentation  can  prove  advantageous. 
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III.  Large  Flexible  Structures 

In  one  important  class  of  parameter  estimation  problems  (sec 

[5,20,24,29,35]),  one  wishes  to  estimate  structural  parameters  (stiffness,  damping, 
loading,  etc.)  in  complex  continuum  models  for  elastic  structures.  The  methods 
discussed  in  this  paper  can  be  successfully  applied  to  such  problems.  To 
explain  the  approach,  we  consider  a  variable  structure  cantilevered  damped 
beam  with  a  tip  body  and  base  acceleration.  Such  a  model  might  be  used  for 
example  to  describe  the  vibrations  of  shuttle  attached  payloads  or  large 

flexible  spacecraft  members.  To  be  more  specific  we  consider  an 

Euler-Bernoulli  beam  of  length  l  with  viscoelastic  damping  (a  Kelvin-Voight 

solid)  and  tip  mass  m  (instead  of  a  tip  body).  Then  the  equations  for 

transverse  (planar)  vibrations  in  the  presence  of  an  axial  force  due  to  base 

acceleration  are  given  by 


a2u  a2 
at2  +  ax3 


[Vtt  - 


r  a  u  a-r 

r2  7T2  +  q3  77^ 


rQ  _?lu 

a3u 

•  ^  3 

v*2  ax2 

**  ^3  2 

*ax2at 

ax  ' 

-  ax 

t  >  0, 

X 

V 

o 

a2u 

a3u  . 

au 

1 1 

2  aT2  + 

ax2at^ 

+  o  — 
ax 

j  L=i 

=  g(t) 


u(t,0)  =  —  (t,0)  =  0 

ax 


u(0,  • )  =  <t>  ,  ut(0,  • )  =  + 


<*•  «*  «"•  -*•  .*•  .*•  «*•  .*•  .** 
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Here  q2  =  p(x)  is  the  linear  mass  density,  q2  =  EI(x)  is  the  flexural 
rigidity  or  stiffness,  q3  =  cDl(x)  is  the  viscoelastic  damping  coefficient,  q4 
=  m  is  the  tip  mass,  f  is  a  distributed  lateral  load,  and  g  is  a 
transversely  applied  force  on  the  tip  mass.  The  internal  tension  a  due  to 
axial  loading  from  base  acceleration  is  assumed  to  be  a  known  function  of  qj 
=  p,  q4  =  m  ,  and  q5  =  a0  =  the  base  acceleration.  From  observations  of  the 
beam  (displacement,  velocity  or  strain),  one  might  desire  to  estimate  the 
parameters  q  =  (qr  q2,  q3,  q4,  q5)  =  (p,EI,cDI,m,a0)  from  a  specified  para¬ 
meter  set  Q. 

For  such  problems  one  can  use  a  semigroup-Trotter-Kato  approximation 
formulation  to  develop  computational  procedures  -  e.g.  see  [5,26,31].  However, 
some  advantages  are  obtained  in  using  a  variational  formulation  (with  "state" 
u(t)  =  (u(t,  •),  u(t,!))  )  similar  to  that  in  (2.5),  (2.6)  or  (2.7).  This  is  done  in 
[6]  and  [15],  [16]  where  detailed  arguments  for  convergence  are  given.  We  shall 
not  discuss  them  fu  thcr  here,  except  to  note  that  the  variational  approach 
allows  for  a  much  weaker  compactness  criterion  on  the  admissible  parameter 
set  Q.  For  example  one  can  hypothesize  compactness  in  the  C(0,l)  norm 
(or  in  the  L°°(0,l)  norm  in  a  sense  similar  tc  that  mentioned  in  Remark  2.1 
above)  with  respect  to  the  components  representing  El  and  cDI.  (Compare  this 
with  the  H2  or  H2weak  compactness  assumptions  in  [5]  and  [26].) 
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IV.  Bioturbation  in  Abyssal  Sediments 

In  the  previous  two  sections,  we  introduced  problems  (seismic  and  flexible 
structures)  for  which  one  can  investigate  parameter  estimation  techniques  using 
either  a  semigroup  formulation  or  a  variational  formulation.  In  this  section 
and  the  next,  we  mention  briefly  two  other  classes  of  problems  which  are  not 
readily  treated  with  a  semigroup  formulation  yet  which  can  easily  be  analyzed 
in  a  variational  setting.  We  first  turn  to  problems  related  to  the  estimation  of 
the  effects  of  biological  mixing  in  abyssal  sediments. 

Sediment  formation  in  lakes  and  deep  seas  is  of  great  importance  to 
geophysical  scientists  who  use  core  samples  of  this  sediment  in  their 
investigations  of  the  history  of  the  earth.  Unfortunately,  the  historical  records 
contained  in  these  core  samples  are  often  perturbed  by  a  phenomenon  called 
bioturbation  [36]  which  is  the  mixing  of  sediments  due  to  the  activities  of 
organisms  near  (on  the  order  of  20-40  cm.)  the  sediment-water  interface. 
These  activities  consist  primarily  of  burrowing  (e.g.,  for  safety)  and 
ingestion-excretion  and  are  not  easily  described  quantitatively. 

An  important  goal  of  some  geologists  is  to  understand  (quantitatively) 
bioturbation  well  enough  so  as  to  enable  one  to  remove  its  effects  and 
properly  interpret  the  data  in  core  samples,  thereby  sharpening  the  details  in 
these  geologic  records.  A  number  of  increasingly  sophisticated  mathematical 
models  have  been  proposed  and  a  brief  review  of  a  number  of  these  is  given 
in  [23].  One  model  of  interest  is  the  one  proposed  by  Guinasso  and  Schink  in 
[23]. 


Briefly,  the  model  involves  one-dimensional  (depth)  transport  equations 
for  a  moving  chamber  (assumed  uniform  in  horizontal  directions)  in  which 
mixing  and  advective  flow  of  material  takes  place.  Depth  in  the  chamber  is 
represented  by  coordinates  x,  0  i  x  <  L,  and  the  chamber  (and  hence  co¬ 
ordinate  system)  is  assumed  to  be  moving  upward  with  a  velocity  q2  =  q2(t) 
[corresponding  to  sedimentation  rate  or  build-up]  so  that  it  is  always  located  in 
the  top  L  cm.  of  the  sediment,  i.e.,  x  =  0  is  always  at  the 

water-sediment  interface.  The  bottom  of  the  chamber  x  =  L  is  located  at 
that  depth  beyond  which  (it  is  assumed)  no  further  changes  (i.e.  no 

bioturbation)  in  the  historical  records  occur.  If  u  =  u(t,x)  is  the 

concentration  of  material  (e.g.,  shards  of  ash,  tracer,  etc.)  with  whose  movement 
one  is  concerned,  a  model  based  on  mass  balance  in  the  chamber,  Fickian  flux 
for  the  bioturbation,  and  appropriate  boundary  flux  considerations  is  given  by 


(4.1) 


t  >  0,  0  <  x  <  L  , 


(4.2) 


-q,(0)  -(t,0)  +  q2(t)u(t,0)  =  0 


(4.3) 


„  au 

-qi(L)  —  (t,L) 

x  ax 


o  . 


Here  qj  is  a  depth  dependent  "bioturbation"  coefficient. 

To  understand  the  effects  of  bioturbation  on  the  distribution  of 


material  concentrations  in  core  samples,  it  is  sufficient  then  to  know  the 
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parameters  q  =  (q1,q2,q3)  ,  q3  =  L  and,  of  course,  know  that  use  of  these 
parameters  in  the  model  gives  one  an  accurate  quantitative  description  of 

concentrations  found  in  core  samples.  Given  observations  from  core  samples, 
this  leads  to  a  parameter  estimation  problem  involving  estimation  of  the 

functions  qt  and  q2  and  the  chamber  length  q3.  In  [17],  it  is  shown  how 
to  formulate  and  treat  such  problems  in  a  discrete  semigroup  -  Trotter-Kato 
approximation  framework  if  one  assumes  that  qt  is  chosen  from  a  class  of 
functions  with  finite  dimensional  parametric  representation  and  q2  is 

independent  of  time.  If  one  wishes  to  treat  more  general  problems  of 
estimating  x  -♦  qj(x)  and  t  -»  q2(t)  in  general  classes  of  functions,  these  arc 
not  so  easily  investigated  using  a  semigroup  setting  (note  that  (4.2)  involves 
time  dependent  unknowns).  However,  a  variational  formulation  not  unlike  that 
given  in  [10]  provides  an  amenable  framework  in  which  convergence 
arguments  can  be  given  under  rather  weak  compactness  assumptions  on  the 

admissible  parameter  sets.  These  arguments,  to  be  given  elsewhere,  are  similar 
in  spirit  to  those  given  in  [10]  for  transport  problems  involving  estimation  of 
spatially  and  temporally  dependent  coefficients. 
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V.  Nonlinear  Population  Dispersal 

In  this  section  we  turn  to  a  brief  discussion  of  estimation  and  a 
variational  formulation  for  problems  that  are  typical  of  population  dispersal 
problems  with  transport  coefficients  (such  as  "diffusion"  coefficients)  that  arc 
density  dependent.  Nonlinearities  of  the  general  type  we  consider  here  arc 
also  important  in  porous  media  estimation  problems. 

Among  the  fundamental  mechanisms  often  of  interest  to  investigators  in 
population  dispersal  (see  [9,11,14,30,33,34])  are  (in  addition  to  the  usual 
emigration-immigration,  birth-death  mechanisms):  a  dispersive  mechanism 

associated  with  random  movement  or  foraging;  an  attractive  or  repulsive  force 
which  induces  directed  movement  of  population  members  toward  favorable  or 
away  from  unfavorable  environmental  surroundings;  and  a  mechanism 
representing  population  pressure  due  to  interference  between  individuals  in  the 
population.  In  mathematical  models  for  transport  including  such  mechanisms, 
it  is  density  dependent  higher  order  terms  that  present  difficulties  in 
theoretical  (and  computational)  considerations.  To  illustrate  how  a  variational 
framework  may  be  used  for  such  problems,  we  shall  sketch  fundamental 
convergence  arguments  for  problems  involving  estimation  of  the  parameter 
function  q  in  simple  models  of  the  form 

du  a  r  3u  > 

(5.1)  —  =  —  I  q(t,x,u) —  ]  t  >  0,  x  e  n  =  (0.1). 

at  ax  ax  } 


(5.2) 


u(t.O)  =  u(t,l)  =  0 
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u(0,  •)  =  u0. 

It’s  not  difficult  to  extend  the  ideas  to  many  of  the  more  detailed  models 
(which  include  other  desirable,  but  more  easily  treated  mathematically, 
transport  terms)  studied  in  [9,14,30,33,34],  While  we  shall  formulate  the 
problem  here  in  terms  of  estimating  rather  general  "diffusion"  coefficients  q 
in  (5.1)  from  a  rather  broadly  defined  class  Q  ,  we  actually  have  as  basic 
motivation  the  treatment  of  coefficients  that  are  bounded  below  bv 
density-independent  base  values,  that  are  saturation  limited  with  rates  that  arc 
affine  as  a  function  of  density  in  the  range  between  the  base-value  and 
saturation  thresholds.  To  be  precise,  our  development  has  been  motivated  by 
problems  where  $  -*  q(t,x,0  is  continuous  and  has  the  form  (see  [14]) 


(5.4) 


q(t,x,0  = 


m(t,x)  t  <  !^(t,x) 

a(t,x)  +  0(t,xH  C0(l’x)  <  t  <  l4(t,x) 
M(t,x)  tt(t,x)  <  *  . 


In  such  problems  we  seek  to  estimate  0,0,^,!^,  (which  determines  q  in 
(5.4)  if  the  continuity  assumption  is  invoked)  from  sets  A,B,r0.r1  respectively. 
We  shall  sketch  our  ideas  in  terms  of  rather  general  conditions  on  the 
parameter  set  Q  ,  noting  that  under  appropriate  assumptions  on  the  sets 
A,B,r0,rj  ,  the  above  example  is  included  as  a  special  case. 


We  first  rewrite  (5.1)  -  (5.3)  in  variational  form,  which  consists  of 


finding  u(t)  e  H^(n)  satisfying 


(5.5)  <ut,4/>  +  <q(t,  -  ,u)Du,D4»>  =  0 

for  all  4»  e  Hj(fl)  along  with  initial  conditions 

(5.6)  u(0)  =  u0  . 

To  define  an  approximate  state  system,  we  assume  we  have  chosen  a  family  of 
finite-dimensional  state  spaces  HN  c  Hj(fl)  with  orthogonal  projections  PN  : 
Hj(fl)  -»  Hn  in  the  H°(n)  inner  product.  We  also  assume  Assumption  A  of 
Section  2  holds  with  H1  replaced  by  H^.  The  approximate  systems  arc  then 
given  by  seeking  uN(t)  e  HN  satisfying 

(5.7)  <u[*,4/>  +  <qN(t,.,uN)DuN,D»j/>  =  0 
for  all  e  Hn  and 


(5.8)  uN(0)  =  PNu0  . 

The  parameters  qN  arc  to  be  chosen  from  some  admissible  parameter  set  Q. 
In  the  usual  manner  (see  Sections  1,2),  for  a  convergence  and  stability  analysis 
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I 

one  desires  to  argue  that  qN  -*  q  in  Q  implies  uN(qN)  -  u(q)  for  arbitrary 

sequences  {qN}  in  Q.  The  mode  of  convergence  in  Q  is,  of  course,  also  the  ' 

sense  in  which  we  wish  to  define  "compactness"  of  Q.  In  this 

particular  example,  we  assume  Q  "compact"  in  the  following  sense: 

. 

Assumption  C:  Any  sequence  in  Q  has  a  convergent  subsequence  {qk} 
with  limit  q  in  Q  in  the  sense 

(5.9)  |qk(t,-,v)  -  q(t,  ■  ,v)  -*  0  in  H°(0,T) 

for  every  v  e  L“(0). 

In  the  case  where  Q  consists  of  functions  of  the  form  given  in  (5.4),  it  is 
straightforward  to  translate  the  compactness  criteria  of  Assumption  C  into 
easily  verifiable  compactness  criteria  on  the  parameter  sets  A,B,r0,rj. 

Further  assumptions  (again  motivated  by  and  easily  verified  for  sets  j 

containing  functions  of  the  form  (5.4))  on  Q  are  necessary  for  convergence 
arguments  and  we  therefore  assume  the  following: 

Assumption  D:  There  is  a  constant  v  >  0  such  that  <q(t,  •  ,v)D«t/,D^>  >,  \ 

vM*  for  every  q  t  Q,  v.4,  t  H*(n). 

Assumption  E:  There  is  a  constant  >  0  such  that  |q(t,-,v)«|»|0  <  j 

MqMo  r°r  a11  v  e  L*(n^  ♦  «  H°(n),  q  e  Q.  1 


L-;. 


-31- 


Assumption  F:  There  exists  B  t  L®((0,T)  x  0)  such  that  |q(t.v  0  .  q(t,x,n)| 

<  B(t,x)|l  -  ti  |  for  all  4,n  t  R1  and  all  q  c  Q. 

We  note  that  Assumption  F  implies  existence  of  a  constant  K  such  that 

|q(t,  -,v)  -  q(t,-,u)|0  <  K|v  -  u|0  for  all  v,u  €  H  °(0). 

We  are  now  in  a  position  to  outline  convergence  arguments,  for  which  it 
suffices  (in  the  usual  manner)  to  consider  z^t)  =  uN(t)  -  PNu(t)  where  uN. 
u  ,  the  solutions  of  (5.7),  (5.8),  (5.5),  (5.6)  corresponding  to  qN,  q,  respectively, 
where  qN  -•  q  in  the  sense  of  (5.9)  given  in  Assumption  C  and  (qN)  is  an  v 
such  sequence  in  Q. 

From  (5.5)  -  (5.8)  we  obtain  z^O)  =  0  and 


(5.10) 


<z^T  ,4/>  =  <q(t, -,u)Du  -  qN(t, -,uN)DuN,  D^>  +  <(I  -  PN)ut,»j<> 


for  all  4*  €  Hn.  Choosing  =  zN  in  (5.10)  and  adopting  the  notation  q(v), 
qN(v)  for  q(t,.,v),  qN(t,  ,v)  throughout,  we  obtain 


(5.11) 


7  — I  zN  I  =  <q(u)Du  -  qN(uN)DuN,DzN>  +  <(I  -  PN)uf,zN>, 
2  dt  1  1  1 


or 


7-  |zN  1 2  +  <qN(uN)DzN,DzN> 

2  dt  1  1 


=  <q(u)Du  -  qN(uN)DPNu,DzN> 


+  <(1  -  PN)ut.zN>. 


From  Assumption  D  we  immediately  find 


(5.12) 


+  v|zN  | 2  <  <q(u)Du  -  qN(uN)DPNu,DzN>  +  -  |(1  -  PN)ut|2 


+ 


1 

2 


2 


Considering  the  first  term  on  the  right  side  of  this  inequality  we  find 
norms  arc  the  H°  norm  unless  otherwise  indicated) 


(5.13)  <q(u)Du  -  qN(uN)DPNu,DzN>  =  <(q(u)  -  qN(u))Du,DzN> 

+  <(qN(u)  -  qN(uN))Du,DzN>  +  <qN(uN)D(u  -  PNu),  DzN> 
<  i  |[Q(u)  -  qN(u)]Du|2  +  j  |DzN| 2 

+  ^  |[qN(u)  -  qN(uN)]Du|2  +  j  |  DzN  | 2 

+  i|qN(uN)D(u  -  PNu)|2  +  J|dzn|2 


The  terms  in  the  right  side  of  this  inequality  can  be  estimated  (wc 
Assumptions  E  and  F)  as  follows: 
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|(qN(u)  -  qN(uN))Du|  S  |dU |„  K  |  u  -  uN  |  S  |du|.K{|(I  -  PN)u  |  +  |zN|) 

|(q(u)  -  qN(u))Du  |  <  | Du  |  |q(u)  -  q^u)],,, 

Using  these  estimates  and  (5.13)  in  (5.12),  we  obtain 


(5.14) 


+  GN(t) 


1  -11/2  1  12 

where  p  =  -  +  Du  and 

2  V  *  |  CD 

GN(t)  =  7  |(I  -  PN)ut|2  +  — ^p*|D(u  -  PNu)|2  +  2^\du  li|(I  .  pN)u  | 2 

+  ^  |Du|2  |d(u)  -  qN(u)|l . 

The  remaining  arguments  are  similar  to  those  (integration,  Gronwall,  etc.)  of 
Section  2.  One  easily  obtains  the  convergence  z^t)  -*  0  in  H°(fi)  for  each 
t  and,  actually,  considering  again  (5.14),  the  additional  results  -»  0  in 
H°(0,T;  HJ(n)).  Of  course,  appropriate  smoothness  (e.g.  ut  e  H°([0,T]  x  O)  ,  Du 
€  H°(0,T;  L®(n))  )  assumptions  on  the  solution  u  must  be  invoked. 

We  remark  that  the  ideas  sketched  here  readily  extend  to 
multi-dimensional  domains  n  (sec  [10])  and  that  computational  efforts  based 

on  the  variational  framework  have  in  preliminary  calculations  (sec  [14])  been 


promising. 
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